Speaker Diarization Based on Gmm Supervectors and Unsupervised Intra-speaker Variability Modeling

نویسنده

Hagai Aronowitz

چکیده

This paper presents a novel framework for speaker diarization. Audio is parameterized by a sequence of GMM-supervectors representing overlapping short segments of speech. Session dependent intra-session intra-speaker variability is estimated online in an unsupervised manner, and is removed from the supervectors using Nuisance Attribute Projection (NAP) The supervectors are then projected using principal component analysis (PCA) into a one-dimensional space (for the two speaker case). Viterbi decoding is used to find an optimal segmentation using the projected supervectors as estimates for log-likelihood ratios. Finally, GMMs are adapted for each assumed speaker and Viterbi re-segmentation is used for producing a final diarization.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised Compensation of Intra-Session Intra-Speaker Variability for Speaker Diarization

This paper presents a novel framework for unsupervised compensation of intra-session intra-speaker variability in the context of speaker diarization. Audio files are parameterized by sequences of GMM-supervectors representing overlapping short segments of speech. Session-dependent intra-session intra-speaker variability is estimated in an unsupervised manner, and is compensated using the nuisan...

متن کامل

Speaker Diarization using Unsupervised Compensation of Within-Speaker Variability

This paper presents a novel framework for unsupervised compensation of within-speaker variability in the context of speaker diarization. Audio session is divided into overlapping short segments, each one parameterized by a GMM-supervector. For each session independently within-speaker variability is estimated in an unsupervised manner, and is compensated using the nuisance attribute projection ...

متن کامل

ALIZE/spkdet: a state-of-the-art open source software for speaker recognition

This paper presents the ALIZE/SpkDet open source software packages for text independent speaker recognition. This software is based on the well-known UBM/GMM approach. It includes also the latest speaker recognition developments such as Latent Factor Analysis (LFA) and unsupervised adaptation. Discriminant classifiers such as SVM supervectors are also provided, linked with the Nuisance Attribut...

متن کامل

On the use of GSV-SVM for Speaker Diarization and Tracking

In this paper, we present the use of Gaussian Supervectors with Support Vector Machines classifiers (GSV-SVM) in an acoustic speaker diarization and a speaker tracking system, compared with a standard Gaussian Mixture Model system based on adapted Universal Background Models (GMM-UBM). GSVSVM systems (which share the adaptation step with the GMMUBM systems) are observed to have comparable perfo...

متن کامل

SVM Speaker Verification Using Session Variability Modelling and GMM Supervectors

This paper demonstrates that modelling session variability during GMM training can improve the performance of a GMM supervector SVM speaker verification system. Recently, a method of modelling session variability in GMM-UBM systems has led to significant improvements when the training and testing conditions are subject to session effects. In this work, session variability modelling is applied d...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Speaker Diarization Based on Gmm Supervectors and Unsupervised Intra-speaker Variability Modeling

نویسنده

چکیده

منابع مشابه

Unsupervised Compensation of Intra-Session Intra-Speaker Variability for Speaker Diarization

Speaker Diarization using Unsupervised Compensation of Within-Speaker Variability

ALIZE/spkdet: a state-of-the-art open source software for speaker recognition

On the use of GSV-SVM for Speaker Diarization and Tracking

SVM Speaker Verification Using Session Variability Modelling and GMM Supervectors

عنوان ژورنال:

اشتراک گذاری